utf-8-history.txt
cl.cam.ac.uk·3d·
Discuss: Lobsters
🔤Character Encoding
The Landscape of Arabic Large Language Models
cacm.acm.org·12h
🤖Grammar Induction
Reading bits in far too many ways (part 1)
fgiesen.wordpress.com·11h·
Discuss: Hacker News
🔢Bitwise Algorithms
Transforming Images into Insights: The Role of OCR in AI Workflows
infosecwriteups.com·1h
👁️OCR Enhancement
Tokenization Strategies for Low-Resource Agglutinative Languages in Word2Vec: Case Study on Turkish and Finnish
arxiv.org·22h
📝Text Parsing
Text Handling Challenges in MHFS Development
computoid.com·1d·
Discuss: Hacker News
🔤Unicode Normalization
An Overview of Digital Tools and Resources for East Asian Studies Reviewed by The Digital Orientalist Members
digitalorientalist.com·13h
📜Digital Philology
Can Your Team Self-Organize?
thefiddler.substack.com·14h·
Discuss: Substack
🌳Trie Structures
Daiichi Sankyo Company, Limited (DSNKY) Discusses On WCLC 2025 Highlights (Transcript)
seekingalpha.com·10h
🎧WAV Metadata
What's The Weirdest Way To Say "River"?
feelingthestones.com·7h·
Discuss: Hacker News
🇨🇳Chinese Computing
Has anyone used Sucuri?
forums.anandtech.com·9h
🌐DNS Security
Yet Another c-bata/go-prompt - Built from scratch to solve the maintenance problem
github.com·13h
📟Terminals
ECMAScript 2025 Language Specification
262.ecma-international.org·15h·
Discuss: Hacker News
🎯Gradual Typing
Enter Sandbox 30: Static Analysis gone wrong
hexacorn.com·4h
🔍Binary Forensics
Computer Networking: A Top-Down Approach (9th Edition)
pearson.com·19h·
Discuss: Hacker News
📡Network Protocol Design
How Python Type Hints Transform Code Quality and Reduce Bugs in Modern Development
dev.to·1d·
Discuss: DEV
🔬Refinement Types
Testinlägg 3
informationsforvaltning.com·17h
📦Deflate
Matcha prices are going through the roof
sazentea.com·1d·
Discuss: Hacker News
🔤Character Encoding
Whose Punctuation Is More Human: Yours or A.I.’s?
nytimes.com·1d·
Discuss: Hacker News
📝Punctuation Engines